RESULT 2 

BOVANGA 

LOCUS 

DEFINE? ION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
MEDLINE 
FEATURES 

sou rce 



BASE COUNT 
ORIGIN 



BOVANGA 924 bp DNA MAM 25-NOV-1994 

Bos taurus (Aberdeen Angus strain) DNA fragment. 

L27712 

L27712.1 GI:443737 

Mitochondrion Bos taurus (strain Aberdeen Angus, organelle 
Mitochondrion Bos taurus) Male DNA. 
Mitochondrion Bos taurus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Cetartiodactyla ; Ruminantia; Pecora; Bovoidea; 
Bovidae; Bovinae; Bos. 
1 (bases 1 to 924) 

Loftus,R.T., MacHugh, D. E. , Bradley, D . G . , Sharp, P.M. and 
Cunningham, P . 

Evidence for two independent domestications of cattle 
Proc. Natl. Acad. Sci . U.S.A. 91 (7), 2757-2761 (1994) 
94195822 

Location/Qualifiers 
1. .924 

/organism="Bos taurus" 
/organelle= "mitochondrion" 
/strain="Aberdeen Angus" 
/db_xref="taxon: 9913" 
/sex="Male" 
303 a 224 c 127 g 270 t 



Query Match 100. OS; Score 158; DB 3; -Length 924 ; 

Best; Local Similarity 100.0%; Pred. No. 5.5e-24; 

Matches 158; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 gtcaatggtcacaggacataaattat)attatatatccccccttcataaaaattcccccct 60 

II M I I I II I I II I II I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | f | || | | | | | | | ! | 
Db 74 5 GTCAATGGTCACAGGACATAAATTATjATTATATAJTCCCCCCTTCATAAAAATTCCCCCCT 804 

Qy 61 taaatatctaccaccacttttaacagacttttccctagatacttatttaaatttttcacg 120 

MINI I II I I I I I I | | | | | | | | | | | | | | | If I II I I II I I I I I I I 1 I 1 I | | | | | 

Db 805 TAAATATCTACCACCACpTTTAACAGACjTTTTCCCTAGAT^CTTATTTAAATTTTTCACG 8 64 

Qy 121 ctttcaatactcaatttagcactccaaacaaagtcaat 158 

M I I I II I I I M I I I I I II I I I I I I I | | | | | | | | | | | | 
Db 8 65 CTTTCAATACTCAATTTAGCACTCCAAACAAAGTCAAT 902 
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addition, determination of the sizes and patterns of induction 
of the UNA transcripts was necessary to enable estimation of 
the redundancy of the clones. Hicreforc, Northern blot 
analysis was performed with UNA isolated from human 
lL-2U-positive T-eell blasts stimulated with either CIIX or 
IL-2 alone, or with a combination of the two agents. 

I lybridizalion of the UNA with probes generated from the 
inserts of each of the 18 putative clones resulted in the 
identification of 4 clones that were solely CHX-induced. For 
the remaining 14 clones, the induction by the combination of 
IL-2 and CIIX could not be accounted for by the effects of 
CIIX alone. 13ascd upon the patterns of induction and 
approximate sizes of the RNA transcripts, 8 readily distin- 
guishable and apparently unique IL-2-induced genes were 
discerned, as partial sequences, among these 14. These are 
described in Table X, 
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(kb) 


(bnses) 








Size of 


Size of 








Partial 


Partial 


IL-2 
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Nucleotide Sequence 


Sequence 


Sequence 


Induction 


OKI 


nucleotide 857 to 2406 of 


1.6 


2406 


24 


OAH) 


SIX) ID NO: 1 








CR2 


nucleotides 1 to 163 and 


l.l 


1283 


7 


(11*) 


nucleotides 1093 to 1283 
of SF.Q ID NO: 3 








CR3 


nucleotides 718 to 901 


2.0 


2450 


22 


(10AK) 


and nucleotides 2265 to 
2450 ofSHQIDNO:5 








CR4 


nucleotides 2101 to 2291 


1.0 


2946 


6 


(101)6) 


and nucleotides 2679 to 
2928 of SEO ID NO: 7 








CR5 


nuclcolide< 763 to 902 


1.4 


2020 


>50 


(10R) 


and nucleotides 1641 to 
2020 of SIX} II) NO: 9 








CKo 


nucleotides 310 to 513 


1.0 


1066 


5 


(11132) 


and nucleotides 687 to 








1066 of SfiQ ID NO: 11 








CR7 


corresponds to 


0.7 


2400 


17 


(lll.i6) 


nucleotides of pini-1 
sequence in Scltcn ct al. 








CR8 


nucleotides 1721 to 1915 


1.5 


2980 


7 


(1JK2) 


of SRO ID NO: 13 









'llic original designations of the CR clones are included in 
parentheses in the left-hand column of Table II. The original 
designations are used herein to refer to the partial sequences 
shown in the column second from the left in Tabic II. As 
shown in Table II and in FIGS. 8A-8H, three of the genes, 
CR 1, CR3, and CR5, were induced by IL-2 alone, while five 
of the genes, CR2, CR4, CR6, CR7, and CR8, were induced 
by both CIIX and IL-2. In several instances, the combination 
of IL-2 and CHX resulted in a marked synergistic induction. 

Example III 

Kinetic Analysis of IL-2-Induced Gene Expression 

ITic temporal expression of the novel, IL-2-induced genes 
was determined by Northern blot analysis, using RNA 
isolated from human IL-2R-positive T-ccll blasts after IL-2 
stimulation in the presence or absence of CHX. Northern 
blots were prepared with 15 mg total RNA isolated from 
G 0 /G r synchronized human T-cells stimulated for 0, 0.5, 1, 
2, 4, or 8 hours with 1 nM IL-2 or IL-2+10 mg/ml CIIX. 
Filters were probed with the cDNA inserts of the IL-2- 
induced clones. 

As shown in FIGS. 9A-9H, two of the genes, 1A8 (FIG. 
9A) and 10D6 (FIG. 913), exhibited rapid induction, reach- 
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ing peak levels within 1-4 hr of IL-2 stimulation and 
returning to basal levels after 8 hr, while the other six clones 
(FIGS. 9C-9II) remained at elevated levels for at least 8 hr 
after IL-2 treatment. The magnitude of IL-2 induction of 
5 steady state RNA levels of the clones ranged from an 
approximately 5-fold elevation of clone 1 1B2 (FIG. 9F) to 
a greater than 50-fold stimulation of clone 10F9 (FIG. 9E) 
during the interval examined. These results arc also sum- 
mari/xd in Table II. Several of the clones were superinduced 
10 by CHX, with an increase observed in both the magnitude 
and duration of the IL-2 response. 

The kinetics of induction of previously characterized 
IL-2- responsive genes have been found to range from those 
such as c-fos, which arc rapidly and transiently induced 
15 within minutes of IL-2 stimulation (Dautry ct al. (1 988) J. 
Biol. Chem. 263:17615-17620), to those which remain at 
elevated levels through G ( to S phase entry (S a bath el al. 
(1990) J. Biol Chem. 265:12671-12678). 

20 Example IV 

Sequence Analysis of Clones Containing Ligand- 
Induced Genes 

To verify the redundancy of the clones as estimated from 
25 Northern analysis, as well as to determine the identities of 
the genes, the cDNA clones were subjected to sequence 
analysis. 

PI asm ids were isolated from the clones of interest essen- 

3(J tially as described by Kraft et a I. ((1988) Biotechniques 
6:544—547), and vector primers were used to sequence the 
termini of the cDNA inserts, employing the Scqucnasc 2.0 
didcoxy sequencing kit (United States Biochemical, 
Cleveland, Ohio). Approximately 200 bases of sequence 

35 were attained from each end of the inserts. 'Hi esc partial 
sequences are described in Table II. Searches of the Gcn- 
Bank and EMBLdata bases were performed with the FASTA 
program as described by Pearson ct al. ((1988) Proc. Natl. 
Acad. Set. (USA) 85:2444-2448). 

The combination of sequence and Northern analyses 
revealed that the 14 putative I L-|- induced clones consisted 
of 8 unique genes, three of which, 1A8, UB2, and 13E2, 
were isolated three times each. Searches of the GcnBankand 
EMBL data bases with the partial sequences enabled the 

45 identification of one clone, 11E6, as pim-1, a previously 
characterized IL-2-induced gene (Dautry ct al. (1988) J. 
Biol. Chem. 263:17615-17620; and Kakut-Houri et al. 
(1987) Gene 54:105-111) which encodes a 33 kD cytoplas- 
mic kinase (Telcrman et al. (1988) Mot. Cell. Biol. 

50 8:1498-1503). 

Thus, by utilizing the method of the invention seven 
unique IL-2 induced genes were cloned, representing novel 
human genes. These clones were identified after screening 
only approximately 800 library colonies, and thus, it is 

55 estimated that as many as 80 additional novel IL-2-induced 
genes remain to be detected in the 10,000-clone library. 

To determine the complete sequences of these clones 
described in Table II the original partial cDNAs were used 
as probes to screen a second cDNA library. It is standard 

60 procedure to use partial cDNA inserts identified by an initial 
screen of a cDNA library to make radiolabeled cDNA probes 
to screen a second library to obtain clones with the portions 
missing in the initial cDNA clones. This was done, briefly, 
as follows: a second cDNA library was prepared from 

65 mRNA obtained from human T cells stimulated for two 
hours with interleukin-2 in the presence of cycloheximide by 
cloning into the Xgt-10 phage vector using standard meth- 
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DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
MEDLINE 
FEATURES 

source 



BASE COUNT 
ORIGIN 



BOVHERB 893 bp DNA MAM 25-NOV-1994 

Bos taurus (Hereford strain) DNA fragment. 

L27725 

L27725.1 GI:443748 

Mitochondrion Bos taurus (strain Hereford, organelle Mitochondrion 
Bos taurus) male DNA. 
Mitochondrion Bos taurus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Cetartiodactyla; Ruminantia; Pecora; Bovoidea; 
Bovidae; Bovinae; Bos. 
1 (bases 1 to 893) 

Loftus,R.T., MacHugh, D . E . , Bradley, D . G . , Sharp, P.M. and 
Cunningham, P . 

Evidence for two independent domestications of cattle 
Proc. Natl. Acad. Sci. U.S.A. 91 (7), 2757-2761 (1994) 

94 195822 

Location /Qualifiers 
1. .893 

/organism="Bos taurus" 
/organelle= "mitochondrion" 
/strains" Hereford" 
/db_xref="taxon: 9913" 
/sex="male" 
292 a 216 c 125 g 260 t 



Query Match 100.0%; Score 265; DB 3; Length 893; 

Best Local Similarity 100.0%; Pred. No. l.le-71; 

Matches 265; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 atccaatgaattttaccaggcatctggttctttcttcagggccatctcatctaaaacggt 60 

M I I I I I I U I I I I I I I I I I I I I I I II I I I I I II I I I I II I I I I I I I | | | | | I | 

Db 44 1 ATCCAATGASkTTTTACCAGGCATCTGGTTCTTTCTTCAGGGCCATCTCATCTAAAACGGT 500 

Qy 61 ccattctttcctcttaaataagacatctcgatggactaatggctaatcagcccatgctca 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I 
Db 501 CCATTCTTTCCTCTTAAATAAGACATCTCGATGGACTAATGGCTAATCAGCCCATGCTCA 5 60 

Qy 121 cacataactgtgctgtcatacatttggtatttttttattttgggggatgcttggactcag 180 

I I I I M I I I I I I I I M I I I I I I I I I I I M I I I I I I II I I II I I I II I I II 

Db 5 61 CACATAACTGTGCTGTCATACATTTGGTATTTTTTTATTTTGGGGGATGCTTGGACTCAG 620 

Qy 181 ctatggccgtcaaaggccctgacccggagcatctattgtagctggacttaactgcatctt 240 

I I I 1 I I I I I M I 1 I I I I I I I I I I I I I I I I I | | | | | | | | | M | | | M | | | | | | | | | | | | | | 
Db 621 CTATGGCCGTCAAAGGCCCTGACCCGGAGCATCTATTGTAGCTGGACTTAACTGCATCTT 680 

Qy 241 gagcaccagcataatgataagcatg 265 

I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 681 GAGCACCAGCATAATGATAAGCATG 705 
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ods. (Sambrook, J. ct al. Molecular Cloning: A Laboratory 
Manual (Cold Spring Harbor Laboratory Press, New York, 
1989) pp. 2.82-2.122). 

Ill is second cDNA library was (hen screened using as 
probes each of the cDNA fragments obtained from the first, 
thiol-sclected cDNA library. Candidate clones that corre- 
sponded to the correct size according to the mKNA were 
then subcloncd and sequenced. The complete cDNA 
sequences (and the predicted amino acid sequences) of 
seven out of eight of these clones are set forth in SEQ ID 
NOs:l-t4 and FIGS. .1-7. Hie complete cDNA sequence 
(and the predicted amino acid sequence) of the eighth clone 
was determined to be identical to that of the IL-2 induced 
gene pim-L. Hie nucleotide sequence as well as the pre- 
dicted amino acid sequence of pin-1 arc set forth at page 605 
in Sclten, G. ct al. (1.986) Cell 46:603-61 1. 

Example V 

Determination of Sensitivity of IL-2-Induced Gene 
Expression 

As a further means of characterizing the regulation of 
expression of these genes, the sensitivity of induction to the 
known IL-2 functional antagonist was investigated. Human 
IL-2R-positiveT-ceII blasts were stimulated with IL-2 in the 
absence or presence of 0.5 mM dibutyryl-cAMP, a concen- 
tration of the mcmbrane-pcrmcant cAMP analog sufficient 
to inhibit IL-2- mediated GI progression without adversely 
a fleeting cellular viability. 'Hie effect of an equivalent molar 
amount of sodium butyratc, which docs not inhibit the IL-2 
res|xinse, was also tested to control for the actions of free 
butyric acid. 

Northern blots were prepared as follows: Human IL-2R- 
posilive T cells were treated with 1 nM IL-2 alone or in 
combination with 0.5 mM dibutyryl cAMP or sodium 
butyratc (NaBt) for I, 2, or 4 hours. Filters were prepared 
with 15 mg total UNA and hybridized with cDNA inserts or 
die IL-2 induced clones. 

Iliese analyses demonstrate that the IL-2 induction of one 
gene, 1A8 (FIG. 10A) is markedly inhibited when the 
intracellular level of cAMP is raised by the addition of 
dibutyryl cAMP, whereas the expression of two others, 
10D6 (FIG. 10B) and 13E2 (FIG. 10C), is augmented 
approximately 3 -fold. Dy comparison, the expression of five 
of the genes was not affected by elevated cAMP (FIGS. 
10D-10H). Thus, the sequences in clone 1A8 may be 
involved in T-cell proliferation. The fact that not all genes 
were sensitive to cAMP indicated that the observed results 
were not due to nonspecific effects, and furthermore that the 
previously documented down-regulation of IL-2R binding 
capacity by cAMP (Johnson ct al. (1990) J. Immunol. 
145:1144—1151) could not account for the inhibition of gene 
expression. 

Example VI 

Determination of Role of T-cell Receptor 
Activation in the Stimulation of Expression of IL- 
2-Induccd Genes 

In order to determine if activation of the T-cell receptor 
mediates the stimulation of expression of cytokine IL-2- 
induced genes, the following study was performed. Northern 
blots were prepared from 20 mg total cellular RNA isolated 
from human peripheral blood mononuclear cells (PBMCs) 
stimulated with a monoclonal antibody (OKT3) specific to 
the CD3 component of the T-cell antigen receptor complex. 
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Blots were probed with cDNA inserts of the IL-2-indueed 
clones. Data was determined as the mcan^SEM (n«6). 

By isolation of RNA at early time intervals, it was 
possible to identify those genes which were induced by 

5 T-cell receptor triggering in the absence of IL-2 effects. As 
shown in FIGS. 11 A— 111 I, only one of the genes, 10D6 
exhibited heightened levels of expression after 2 lir of T-cell 
receptor activation, while the seven others were apparently 
insensitive to this stimulus. Two of the clones, 1F5 and 11 

J0 B2, were undetectable, even after seven days of autoradio- 
graphic exposure of the Northern blots. T\vo other genes, 
11E6 and 13E2, were expressed at relatively high levels 
regardless of the stimulus; activation with anti-CD3 did not 
induce RNA expression beyond the level observed by cul- 

15 turc in medium alone. Identical results were obtained after 
1 and 4 hr of stimulation. 

To determine whether the cells were actually activated via 
CD3, aliquots of the cells were left in culture for 52 hr in the 
presence of 10 rng/mi CI IX, alone, OKT3 alone, or OKT3+ 

20 CHX, after which cell cycle progression was monitored by 
[ 3 H]-thymidinc incorporation into RNA. 

As shown in FIG. 12, the cells were sufficiently stimulated 
by anli-CD3. Thus, the T-cell receptor- induced expression of 

^ only one of the genes was comparable to that seen with IL-2 
stimulation, while the expression of the seven others was 
unique to the IL-2 signaling pathway. Thus, the methods 
described herein to identify IL-2- induced gene successfully 
selected and enriched for these genes that are highly specilic 

^ 0 for cytokine (IL-2) activation. 

Of the 8 IL-2 induced G, progression genes reported here, 
only one appears to also be induced during the T cell 
receptor-mediated competence phase of the cell cycle. Thus, 
while several genes such as c-fos, c-myc and c-raf-l arc 

35 known to be induced during both the initial Gq-Gj and 
subsequent Gj-S phase transitions, the expression of a 
number of IL-2-stimulatcd genes is unique to the latter 
event. In addition, the immediate-early genes reported here 
appear to define a class distinct from the IL-2-induced genes 

40 isolated by Sabath ct al. ((1990) J, HioL Cliem. 
265:12671-12678). These investigators utilized a differen- 
tial screening procedure to isolate genes expressed al the 
Gj/S phase boundary in a murine T helper clone which was 
stimulated with IL-2 for 20 hr in the absence of protein 

45 synthesis inhibitors. In this case, the expression of only 3 of 
the 21 clones isolated was inhibited by CHX, while the 
remainder were insensitive to this agent. Ill is pattern of 
regulation markedly contrasts with the CHX supcrinduction 
observed with the immediate-early IL-2-induccd genes 

50 described here. Moreover, these observations indicate that 
IL-2 stimulates a complex program of gene expression, 
ranging from those genes induced very early in G^ through 
those subsequently expressed at the GJS phase transition. 

55 Example VII 

Cloning and Analysis of CR8 

As described above, the CR8 gene encodes a novel basic 
helix-loop-helix (bHLH) protein. While the CR8 transcript 

60 is ubiquitously expressed in many tissues, it is induced by 
IL-2 as well as by IL-3 in cytokine-dependent lymphoid cell 
lines. In an IL-2-dependent human T cell line Kit 225, the 
CR8 transcript is induced not only by IL-2, but also by 
interferon b and forskolin, which elevates intracellular 

65 cAMP. The bHLH domain of CR8 shows the highest struc- 
tural homology to a Drosophila transcriptional repressor 
hairy. The recombinant CR8 protein binds preferentially to 
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BOVHERB 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
MEDLINE 
FEATURES 

.sou rc 



Sty ID 9> 



BOVHER 
Bos t.t 
L2772:^ 
L2772:' 

Mitoch 
Bos t.i 
Mitoch 
Eukar - 
Mamma 
Bovid. : 
1 (b, ; 
Loftu:- 
Cunni: . 
Evidei i 
Proc. 
94195. 



B 893 bp DNA MAM 

irus (Hereford strain) DNA fragment. 



25-NOV-1994 



BASE COUNT 
ORIGIN 



.1 GI:443748 

>ndrion Bos taurus (strain Hereford, organelle Mitochondrion 
irus) male DNA. 
>ndrion Bos taurus 

)ta; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
.a; Eutherin; Cetartiodactyla; Ruminantia; Pecora; Bovoidea; 
/ Bovinae; Bos. 

es 1 to 8 r '3) 

■R.T., MacHugh, D . E . , Bradley, D . G . , Sharp, P.M. and" 
fham, P . 

:e.for two independent domestications of cattle 
latl. Acad. Sci. U.S.A. 91 (7), 2757-2761 (1994) 

:2 

Location /Qualifiers 
1. .893 

/organ! :n="Bos taurus" 
/ organo ;.e="mi tochondrion" 
/strain "Hereford" 
/db_xre!"-"taxon: 9913" 
/sex="m . le" 
a 216 c 125 g 260 t 



Query Match 
Best; Local Simi : 
Matches 4 29; 

gcatatd . 
I I I I i M 
GCATATAA 



ca tagta 
I I I I I M ' 
CATAGTA' 

agatcac' 
I I I M I I < 



tcttcto; 
i I I I I II 
TCTTCTC': 

ctggttc 
M I I I M 
ctggttc:" 



Qy 


1 


Db 


225 


Qy 


61 


Db 


285 


Qy 


121 


Db 


345 


Qy 


181 


Db 


404 


Qy 


241 


Db 


464 


Qy 


301 


Db 


524 



93.0%; Score 448.2; DB 3; 
irity 89.)%; Pred. No. 1.9e-92; 
-nservativ 51; Mismatches 1; 



Length 893; 



Indels 



1; Gaps 



1; 



jcaagyacat 'ayyyc taya vyagtacataayrcatayaat tattrayygta 60 

1 M I : I I I I : I : : : I I I : I : : I f | | | i | i | : : | | | | ; |:|::IU 

iCAAGT ACAT^ ACCTCTAT AGCAGTACATAATACAT AT AATT ATTGACTGTA 2 8 4 

ittatrtcaa i yycatyctyrayaryatatytayyatatayyycytnccayt 120 

1111 = 1111 ::lll:ll::|:|::||||:||::|||||:::|:| |||:| 

VTTATGTCAA TTCATTCTTGATAGTATATCTATTATATATTCCTTACCATT 34 4 



igcttaayta^catgccgcgtgaaaccarcaacccgctrrgcagnggatccc 
N i I [ I : I 1 I I I I I 1 I I I I I I I I i I i : I I I I I I j | | : : i | | | ||||M| 



:tccgggcccntarayygtgggggtcgctatyyartgaaytttaycaggcat 
1 I I I I I I I I I ! I : I - I I I I I I I I I I I I I I :: I : I I I I : | | | | : | | | | | | | 
:TCCGGGCCC TAAACCGTGGGGGTCGCTATCCAATGAATTTTACCAGGCAT 



180 



240 



463 



■tcttcaggg^catctcatctaaaryrgtccattcyttcctcttaaataaga 300 
M I I I I I I I 1 I I I I I 1 I I I I ! I ::: I I I I I I | | : | | | | | | | I | | | I | | M 
JTCTTCAGGC :ATCTCATCTAAAACGGTCCATTCTTTCCTCTTAAATAAGA 523 



catctcg.v:ggactaatgrctaatcagcccatgctcacacataactgtgytgtcatacat 360 

1 M I I I i I I I : I I II I I I I I I I I | I | | | I | | | j | | | | | | | | ; | | | | | | | | || 

CATCTCG/ 1 GGACTAATC :TAATCAGCCCATGCTCACACATAACTGTGCTGTCATACAT 58 3 



r 



Qy 

Db 

Qy 

Db 



361 ttggtati 
I I I I I I I : 
58 4 TTGGTAT'! 



ttttattttnggggatgcttggactcagctatggccgtcaaaggccctgac 4 20 
I I I I i I I I M I I I I I I I II I I [ I I I I II I I I i I i II t I I I I I I I I I I I I i I 
'TTTTATTTTGGGGGATGCTTGGACTCAGCTATGGCCGTCAAAGGCCCTGAC .64 3 



421 ccggagcci .eta ttgtagctggacttaactgcatct tgagcaccagcataa tga targcr 480 

I I I I I I I I I 1 I I I I I I II I I I I I I I II I I I I I II I I I i | M I I I I I I I I I I I i i I : I I : 
64 4 CCGGAGCA XTATTGTAGCTGGACTTAACTGCATCTTGAGCACCAGCATAATGATAAGCA 703 



Qy 481 tg 482 
I I 

Db 704 TG 705 



i 



c 
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REFERENCE 
AUTHORS 

TITLE 
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FEATURES 

source 



fl|y iV. f 5" £ #31 uw. .Ui<u< 



BOVWFUA 8 92 bp DNA ^ '*^MAm' 

Bos indicus (White Fulani strain) DNA fragment. 

L27720 

■L27720. 1 GI : 443758 



2 5 -NOV- 1994 



BASE OVNT 
ORIGIN 



Mitochondrion Bos indicus (strain White Fulani, organelle 
Mitochondrion Bos indicus) female DNA. 
Mitochondrion Bos primigenius indicus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Cetartiodactyla; Ruminantia; Pecora; Bovoidea; 
Bovidae; Bovinae; Bos. 
1 (bases 1 to 892) 

Loftus, R.T. , MacHugh, D. E. , Bradley, D . G . , Sharp, P.M. and 
Cunningham, P . 

Evidence for two independent domestications of cattle 
Proc. Natl. Acad. Sci. U.S.A. 91 (7), 2757-2761 (1994) 
94195822 

Location /Qualifiers 
1. .892 

/organism="Bos primigenius indicus" 
/organelle^ "mitochondrion" 
/strain="White Fulani" 
/db_xref="taxon: 9915" 
/sex="female" 
291 a 216 c 126 g 259 t 



Query Match 93.0%; Score 448.2; DB 3; Length 892; 

Best; Local Similarity 89.0%; Pred. No. 1.9e-92; 

Matches 429; Conservative 51; Mismatches 1; Indels 1; Gaps 



Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 



1 gcatataagcaagyacatrayyyctayavyagtacataayrcatayaattattrayygta 60 
I I I I i I I I I I 1 I I : I I I I : I : : : I I I : I : : I I I I I I I i I : : I t I I : I I I I I I I : I : : I I I 
2 2 5 GCATATAAGCAAGTACATGACTTCTATAGCAGTACATAATACATATAATTATTGACTGTA 28 4 



61 



V 



120 



catagtacattatrtcaaayycatyctyrayaryatatytayyatatayyycytnccayt' 
I I I I I I I M i I I I : I I 1 I I : : I I! : I I : : I : I : : I I It : ! I : : I t ! I I : : : I : I M I : I 
2 8 5 CATAGTACATTATGTCAAATTCATCCTTGATAGTATATCTATTATATATTCCfTTACCATT 34 4 

12 1 'agatcacgagcttaaytaccatgccgcgtgaaaccarcaacccgctrrgcagnggatccc 180 

I I I I I I I i I I I I I I I : I I I I I I I I I I I ! I I I I i I I I : I I I I I I I I I : : t I I I I II I I I I 
34 5 AGATCACGAGCTTAATTACCATGCCGCGTGAAACCAGCAACCCGCTAGGCAG-GGATCCC 4 03 

I'-] 1 tcttctcgctccgggcccatarayygtgggggtcgctatyyartgaaytttaycaggcat 
I I if I I I I I I I I I I I I I I I I I : I : : I I I I I t I I I I I I I I : : I : I I II : I I I I : II I I I I I 
4 0 4 TCTTCTCGCTCCGGGCCCATAAACCGTGGGGGTCGCTATCCAATpAACTTTACCAGGCAT 

241 ctggttctttcttcagggccatctcatctaaaryrgtccattcyttcctcttaaataaga 
I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I ::: I I I I I I I I : I I I I I I M I M I I I II 
4 64 CTGGTTCTTTCTTCAGGGCCATCTCATCTAAAACGGTCCATTCTTTCCTCTTAAATAAGA 



240 



463 



300 



523 



Qy 

Db 



301 catctcgatggactaatgrctaatcagcccatgctcacacataactgtgytgtcatacat 360 

I I I i I I I I t I I I I I I I I I : I I I I I I I I I I I I I I I I I II I I I I I | I | | | | : I M I I I I I I I 
52 4 CATCTCGATGGACTAATGGCTAATCAGCCCATGCTCACACATAACTGTGCTGTCATACAT 583 



6,o: 

49 

This result is consistent with the predictions from ONA- 
bllUI protein eo-crystaLs (Fcrr-IVAmar, A. R. ct al. (1994) 
UMBO J. 13:180-189; l ; err-0*Amar, A. \L et al. (1993) 
Nature 363:38-45). However, it is noteworthy in thai CR8 
is the first bllLII vertebrate protein without a leucine zipper 
(LZ) motif found to bind Class B sites. Protein diraerization 
is more selective than DNA binding, but currently no rules 
are available that predict the dimerization preference of any 
given HLH proteins. Even so, a Class A-binding protein 
seems to form DNA binding helcrodimers only with other 
Class A proteins, and a bllLH protein with a LZ docs not 
form hctcrodimers with those without LZs (Blackwood, E. 
M. ct al. (\ 99\) Science 251:1211-1217; Rrendcrgast, G. C. 
et al. (1991) Call 65:395-407). Therefore, if CR8 does form 
hctcrodimers, the most likely partner is a class B-binding 
bllLII protein without a VZ. 

Although CR8 is most homologous to hairy in its bllLH 
domain, its preference for Class B E-box binding sites rather 
than class C sites, and its lack of a C-terminal WPRW motif, 
clearly sets CR8 apart and docs not predict necessarily that 



17,914 

50 

CR8 may act as a transcriptional repressor as do hairy- 
related proteins. Recently, Id proteins that lack a basic region 
have been shown to favor proliferation, presumably by 
forming hctcrodimers with differentiation inducing bllLII 

5 proteins, thereby preventing DNA binding and transcrip- 
tional activation of genes that program differentiation 
(Baronc, M. V. et al. (L994) Proc. Natl. Acad Sci USA 
91:4985-4988; Mara, E. et al. (1994) 7. Biol. Cham. 
269:2139-2145; lavarone, A. et al. (1994) Genes & Dev 
1270-1284). Therefore, CR8 could promote proliferation by 
suppressing differentiation by cither of these transcriptional 
repressor mechanisms. Alternatively, CR8 could also acti- 
vate transcription like the bHLII-LZ Myc family. 

j5 Equivalents 

Those skilled in the art will recognize, or be able to 
ascertain using no more than routine experimentation, many 
equivalents of the specific embodiments of the invention 
described herein. Such equivalents arc intended to be 
encompassed by the following claims. 



SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(iii) NUMBER OF SEQUENCES! 35 



( 2 ) INFORMATION FOR SEQ ID NO 1 1 : 

<i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH: 2406 base pairs 

( B) TYPE: nucleic acid 

(C) STRAND EDNESS : oinyle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE : 

(A) NAME /KEY : CDS 

(B) LOCATION: 116.. 722 

<xi) SEQUENCE DESCRIPTION : SEQ ID NO : 1 : 



AACCCAACCG CAGTTGACTA GCACCTGCTA CCGCGCCTTT GCTTCCTGGC GCACGCGGAG 60 

CCTCCTGGAG CCTCCCACCA TCCTCCCTAC TACCTGCTGC CCTCCGCCCC CAGCC ATG LIB 

TGC CGC ACC CTG GCC GCC TTC CCC ACC ACC TGC CTG GAG AGA GCC AAA 166 

GAG TTC AAG ACA CGT CTG GGG ATC TTT CTT CAC AAA TCA GAG CTG GGC 214 

TGC GAT ACT GGG AGT ACT GGC AAG TTC GAG TGG GGC AGT AAA CAC AGC 262 

AAA GAG AAT AGA AAC TTC TCA GAA GAT GTG CTG GGG TGG AGA GAG TCG 310 

TTC GAC CTG CTG CTG AGC AGT AAA AAT GGA GTG GCT GCC TTC CAC GCT 358 

TTC CTG AAG ACA GAG TTC AGT GAG GAG AAC CTG GAG TTC TGG CTG GCC 4 06 

TGT GAG GAG TTC AAG AAG ATC CGA TCA GCT ACC AAG CTG GCC TCC AGG 454 

GCA CAC CAG ATC TTT GAG GAG TTC ATT TGC AGT GAG GCC CCT AAA GAG 502 

GTC AAC ATT GAC CAT GAG ACC CGC GAG CTG ACG AGG ATG AAC CTG CAG 550 

ACT GCC ACA GCC ACA TGC TTT GAT GCG GCT CAG GGG AAG ACA CGT ACC 598 

CTG ATG GAG AAG GAC TCC TAC CCA CGC TTC CTG AAG TCG CCT GCT TAC 646 

CGG GAC CTG GCT GCC CAA GCC TCA GCC GCC TCT GCC ACT CTG TCC AGC 694 

TGC AGC CTG GAC CAG CCC TCA CAC ACC T GAGTCTCCAC GGCAGTGAGG 7 42 



r 



Qy 3 til ttggtatttttttattttgggggatgcttggactcagctatggccgtcaaaggccctgac 4 20 

M i i ! I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I | M II I I I I I I I I I I I I II I it 

Db 5H4 TTGGTATTTTTTTATTTTGGGGGATGCTTGGACTCAGCTATGGCCGTCAAAGGCCCTGAC 64 3 

Qy 421 ccggagcatctattgtagctggacttaactgcatcttgagcaccagcataatgatargcr 480 

I I I I I I I I I I I I I I I I II I I I I II I I I I I I II I I M I I I I M I I II I I I II : I I : 

Db 64 4 CCGGAGCATCTATTGTAGCTGGACTTAACTGCATCTTGAGCACCAGCATAATGATAAGCG 703 

Qy 431 tg 4 82 
I I 

Db 704 TG 705 



r 
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such as MyoD and AS-C gene products show a cell-type competitor. FIG. 17B shows competition of the binding of 

specific expression, others such as El 2/E47 and da arc fairly CR8 bHLH to the Class B sites. 0.5 ng of the radiolabeled 

ubiquitously expressed. The tissue distribution of CR8 was Class H probe was incubated with CR8 bl ILI I in the absence 

analyzed using a Multiple 'I Issue Northern blot. CR8 Iran- (i anc |.) or thc prescn ce (lanes 2 to 7) of either 25 ng (50-fold 
scripts ol the expected size (3.2 kb) were detected in all 5 CXCCSH . lancs 2 , 4 and 6) and 250 ng (500-fold excess; lanes 

tissues examined except placenta (sec FIG. 16) FIG. 16 3> 5 and 7) ofunlabclcd competitors. FIG. 17B demonstrates 

clcmonstralcs that thc Multiple I issue Northern Mloi mem- tha| thc bM of CRR bm „ [Q (hc ra(liolahclc(l Class „ 

branes (Clontcch; each Jane contains 2 Jg poly(A)+ UNA si(c can bc M { $MM ^ 

CR8 Ibc " C> V W ~««P>«te>y b y « 500-fold excess of Class B site (lanes 2 and 

The expression of CR8 in peripheral blood leukocytes 10 ^ ^ * ° f , Cl«ss C site only partially 

was unexpected, in that CR8 is not expressed by quiescent dls P laccd CR8 bHLH from thc labeled Class D probe (lanes 
T cells. This may reflect much higher sensitivity of Multiple 4 and 5 ) and thc N " box sc Q uenct; did not affccl lhc bindin g 
Tissue Northern blot prepared from poly(A)+ RNA com- at a11 ( ,ancs 6 and 7 )- since a11 ,hcsc experiments were 
pared to our previous Northern blots, which used total RNA. donc in thc absence of other I I LI I proteins, it appears that 
Alternatively, thc contribution of other leukocytes such as 13 15 CR8 bHLH bound to the Class 13 sequence as a homodimcr 
cells, NK cells, monocytes and granulocytes that were not the highest affinity. 

present in the original T cell preparations could account for The CR8 gene encodes a novel bHLH protein that appears 
CR8 expression by thc peripheral blood leukocytes. to fit into a class by itself. Other than c-myc, CR8 is thc first 

DNA-binding activity of CR8: bllLH-containing protein found to be induced by cytokines. 

r Hie canonical bHLH binding sequence is called thc 20 Also, from its predicted amino acid sequence, CR8 clearly 

E-box, CANNTG, originally identified in thc immunoglo- contains a bHLH motif most closely related to the hairy 
bulin heavy chain enhancer (Ephrussi, A. ct al. (1985) family, but the amino acid sequence of the basic region 

Science 227:134-140). Many bHLH proteins were later difiTcrs from other hairy-related proteins: the position of the 

divided into two mutually exclusive classes, depending on proline residue is N-terminal to the defining proline of the 

whether they bind to thc Class A sites (CAGCTG/CACCTG) 25 hairy-rciated proteins, and CR8 lacks the C-lcrminai WRPW 

or the Class B sites (CACGTG/CATGTG) (Dang, C. V. ct al. sequence found in all other hairy-rclated-relatcd protcias. 
( 1992) /'roc. Natl. Acad. Sci. USA 89:599602). Hie presence These differences in the amino acid sequence, especially of 

of an arginine residue al position 13 0*R13", sec FIG. ISA) thc basic region, most likely account for the unique binding 

in the basic region, which CR8 contains, is considered to be specificity of the CR8 bHLH domain. Instead of preferring 

the key structural criterion that defines Class 13 binding 30 Class C sites according to thc other hairy-related family 

specificity. However, despite the presence of "R 13", hairy- members (Ohsako, S. et al. (1994) Genes & Dev 

related bHLH proteins are reported to prefer noncanonical 8:2743-275572; Van Dorcn, M. et al. (1994) Genes & Dev 

binding sites such as thc N-box (CACNAG) (Akazawa, C 8:2729-2742), CR8 binds preferentially to Class 13 sites, 
et al. (1992)7. Bioi Chenu 267:21879-21885; Sasai.Y. et al. Thc identification of CR8 as a bHLH protein, thereby 

(1992) Genes & Dev 6:2620-2634; Tietzc, K. et al. (1992) 35 functioning, most likely, as a regulator of subsequent gene 

Proc. Natl Acad. Sci. USA 89:6152-6156) or thc Class C expression stimulated by IL-2, provides a link between the 

(CACGCG) sites (Ohsako, S. ct al (1994) Genes & Dev immediate biochemical events triggered by cytokine reccp- 

8:2743-2755; Van Dorcn, M, et al. (1994) Genes & Dev tors and thc subsequent events of proliferation and/or dif- 

8:2729-2742.). Therefore, thc binding of CR8 to ail of these fcrcnliation. Thus far, IL-2 has been found to activate thc 

sites was tested. 40 serine/threonine kinase proto-oncogene Raf-1 (Turner, B. ct 

Since it is well documented that the bHLH domain is al. (1991) Proc. Natl. Acad. Sci. USA 88:1227-1231; 

sufficient to determine its ONA binding specificity Zmuidzinas, A. ct al. (1991) Moi. Cell Biol. 11:2794-2803) 

(Pognoncc, P. et a I. (1994) Mol Cell ttiol. 11:5125-5136), and thc tyrosinc-spccific kinases JAK 1 and JAK 3 

the bHLH domain of CR8 (CR8 bHLH) expressed in E. coli (Bcadling, C. et al. (1994) EMBO J. 13:5605-5615; 

was employed for this study, A histidine-lag was added to 45 Miyazaki, T. ct al. (1994) Science 266:1045-1047; Russell, 

facilitate the purification of the recombinant protein. While S. M. ct al. (1994) Science 266:1042-1045). 
most of thc recombinant protein localized in inclusion From thc results described herein comparing the effects of 

bodies, there was still enough soluble protein in the IFNa and forskolin on CR8 and c-myc gene expression, the 

cytoplasm, thereby enabling its purification under native regulation of these two bHLH genes is clearly distinct. It is 

conditions using a Ni 2+ column. Asingle band of protein was 50 also of interest that although IFNfJ antagonizes IL-2- 

detcctcd at the expected size (16.6 kD with thc histidine-tag) promoted cell cycle progression, it promotes the expression 

by silver staining. IZMSA was carried out using this rccom- of both CR8 and c-myc. Indeed, induction of c-myc by IFN(3 

bin ant protein. was unexpected, as it was previously reported to be sup- 

FIG. 17A is an EMSA shows binding of recombinant pressed by IFNs (Einat, M. et al. (1985) Nature 

bHLH proteins to the radiolabeled probes. CR8 bHLH 55 313:597-600). The bHLH region of CR8 is most bomolo- 

protein strongly binds to the Class B (CACGTG, lane 3) and gous to that of hairy and the bHLH proteins of thc E(spl)-C. 

the Class C (CACGCG, lane 4) sites, and weakly to the N In Drosophila, the hairy-related bHLH proteins function as 

box (CACNAG, lane 5) sequence but not to the Class A transcriptional repressors, and this activity requires the basic 

(CACCTG, lane 2) site. Binding of the bHLH region of da DNA binding region, as well as the interaction with a 

protein to the Class A site is shown as control (lane 1). As <so non-HLII protein termed groucho (gro) via the C-terminal 

shown in FIG. 17 A, CR8 bHLH bound to the Class B and WRPW motif (Paroush, Z. et al. (1994) Cell 79:805-315). 

the Class C probes, but only weakly to the N-box probe, and Although mammalian homologues of gro have been identi- 

not at all to the Class A probe. However, the control da fied (Stifani, S. et al. (1992) Nat. Genet, 2:119-127), they 

bHLH protein effectively recognized and bound to the same are not likely to interact with CR8 because CR8 lacks the 

Class A probe. 65 WRPW motif. 

To examine thc relative binding afilnity, a large excess of Thc results described herein indicate that CR8 recomni- 

non-labeled oligonucleotide was added to the reaction as nant protein binds to Class B E-box sites as a homodimer. 
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RESULT 4 

BOVHERB/c 

LOCUS 

DEFINITION 
ACCESS I ON 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

- TETLE 
> .'< )[.' RNAL 
MEC L [ME 

EE A CUR ES 

source 



ba:-'-; count 
or ; ri n 



BOVHERB 893 bp DNA MAM 25-NOV-1994 

Bos taurus (Hereford strain) DNA fragment. 

L27725 

L27725. 1 GI:443748 

Mitochondrion Bos taurus (strain Hereford, organelle Mitochondrion 
Bos taurus) male DNA. 
Mi tochondrion Bos taurus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Cetartiodactyla ; Ruminantia; Pecora; Bovoidea; 
Bovidae; Bovinae; Bos. 
1 (bases 1 to 893) 

Loftus, R.T. , MacHugh, D . E . , Bradley, D . G . , Sharp, P.M. and 
Cunningham, P. 

Evidence for two independent domestications of cattle 
Proc. Natl. Acad. Sci . U.S.A. 91 (7), 2757-2761 (1994) 

94L95822 

Loca t ion/Qua li f iers 
1. .893 

/ organism="Bos taurus" 
/organelle= "mi tochondrion" 
/strain="Hereford" 
/db__xref="taxon: 9913" 
/sex="male" 
292 a 216 c 125 g 260 t 



Query Match 100.0%; Score 43; DB 3; Length 893; 

B*2st Local Similarity 100.0%; Pred. No. 6.3e-06; 

Matches 43; Conservative 0; Mismatches 0; Indels ■ 0; Gaps 0; 

Qy 1 taa tgtcca tgcttatcattatgctggtgctcaagatgcagtt 4 3 

I I II "l I M I I I M I I i I I I I I I I I I I I I i I I I I I I I I I I I I i I 
Db 712 TAATGTCCATGCTTATCATTATGCTGGTGCTCAAGATGCAGTT 67 0 



r 
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-continued 



TAG CTG CAG CTG TAC GCC ACC CTG CTG CTG CTT CTC ATT GTC TCG GTG 861 

CTC GCC TGC AAC TTC AGT GTC ATT CTC AAC CTC ATC CGC ATG CAC CGC 909 

CGA AGC CGG AG A AGC CGC TGC GGA CCT TCC CTG GGC AGT GGC CGG GGC 95 7 

GGC CCC GGG GCC CGC AGG AGA GGG GAA AGG GTG TCC ATG GCG GAG GAG 1005 

ACG GAC CAC CTC ATT CTC CTG GCT ATC ATG ACC ATC ACC TTC GCC GTC 1053 

TGC TCC TTG CCT TTC ACG ATT TTT GCA TAT ATG AAT GAA ACC TCT TCC 1101 

CGA AAG GAA AAA TGG GAC CTC CAA GCT CTT AGG TTT TTA TCA ATT AAT 114 9 

TCA ATA ATT GAC CCT TGG GTC TTT GCC ATC CTT AGG CCT CCT GTT CTG 1197 

AGA CTA ATG CGT TCA GTC CTC TGT TGT CGG ATT TCA TTA AGA ACA CAA 124 5 

GAT GCA ACA CAA ACT TCC TGT TCT ACA CAG TCA GAT GCC AGT AAA CAG 129 J 

GCT GAC CTT T GAGGTCAGTA GTTTAAAAGT TCTTAGTTAT ATAGCATCTG 134 3 

GAAGATCATT TTGAAATTGT TCCTTGGAGA AATGAAAACA GTGTGTAAAC AAAATGAAGC 1403 

TGCCCTAATA AAAAGGAGTA TACAAACATT TAAGCTGTGG TCAAGGCTAC AGATGTGCTG 1463 

ACAAGGCACT TCATGTAAAG TGTCAGAAGG AGCTACAAAA CCTACCCTCA GTGAGCATGG 1523 

TACTTGGCCT TTGGAGGAAC AATCGGCTGC ATTGAAGATC CAGCTGCCTA TTGATTTAAG 158 3 

CTTTCCTGTT GAATGACAAA GTATGTGGTT TTGTAATTTG TTTGAAACCC CAAACAGTGA 164 3 

CTGTACTTTC TATTTTAATC TTGCTACTAC CGTTATACAC ATATAGTGTA CAGCCAGACC 170 3 

AGATTAAACT TCATATGTAA TCTCTAGGAA GTCAATATGT GGAAGCAACC AAGCCTGCTG 17 C 3 

TCTTGTGATC ACTTAGCGAA CCCTTTATTT GAACAATGAA GTTGAAAATC ATAGGCACCT 182 3 

TTTACTGTGA TGTTTGTGTA TGTGGGAGTA CTCTCATCAC TACAGTATTA CTCTTACAAG 188 3 

AGTGGACTCA GTGGGTTAAC ATCAGTTTTG TTTACTCATC CTCCAGGAAC TGCAGGTCAA 194 3 

GTTGTCAGGT TATTTATTTT ATAATGTCCA TATGCTAATA GTGATCAAGA AG ACT TT AGG 200 3 

AATGGTTCTC TCAACAAGAA ATAATAGAAA TGTCTCAAGG CAGTTAATTC TCATTAATAC 2063 

TCTTTATCCT ATTTCTGGGG GAGGATGTAC GTGGCCATGT ATG AA GC CAA AT ATT AG GCT 212 3 

TAAAAACTGA AAAATCTGGT TCATTCTTCA GATATACTGG AACCCTTTTA AAGTTGATAT 218 3 

TGGGGCCATG AGTAAAATAG ATTTTATAAG ATGACTGTGT TGTACTAAAA TTCATCTGTC 224 3 

TATATTTTAT TTAGGGGACA TGGTTTGACT CATCTTATAT GGGAAACCAT GTAGCAGTGA 2 303 

GTCATATCTT AATATATTTC TAAATGTTTG GCATGTAAAC GTAAACTCAG CATCACAATA 2 363 

TTTCAGTGAA TTTGCACTGT TTAATCATAG TTACTGTGTA AACTCATCTG AAATGTTACC 242 3 

AAAAATAAAC TATAAAACAA AATTTGA 2450 

(2) INFORMATION FOR SEC; ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 358 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 



Met Giy Asn Ala Ser Asm Asp Ser Gin Ser Glu Asp Cys Glu Thr Arg 

15 10 L5 

Gin Trp Phe Pro Pro Gly Glu Ser Pro Ala lie Ser Ser Val Met Phe 

20 25 30 

Ser Ala Gly Val Leu Gly Asn Leu lie Glu Leu Ala Leu Leu Ala Arg 



